Introduction

In this study, we analysis the data of U.S. retail sales of cars from 1993 to Jul.2022.

  1. Log-transformation and first-order differencing are used to eliminate the trend and seasonal structure.

  2. Three multiplicative decomposition models are constructed.

     Model 1: Multiplicative Decomposition Model to Sales (1993-2017)
     Model 2: Multiplicative Decomposition Model (With Lag Residuals)
     Model 3: Multiplicative Decomposition Model to Log Return Sales
  3. Forecast data of sales from 2018 to 2022 by Model 1 is compared to the real data.

  4. Partial F-test is used to determine the necesserity of variables.

  5. Residual diagnostics are performed to compare the goodness of three models.

  6. Seasonal static index are calculated, tabulated, and interpreted.

  7. Model 1 and model 3’s prediction ability of static seasonal index are compared.

  8. Sales data from Mar.2020 to Jul.2022 during COVID-19 period are discussed.

Exploratory Data Analysis

The data gives monthly U.S. retail sales of cars, in millions of dollars, for the time period 1993 through Jul.2022.

Date Sales c348 s348 c432 s432 Changepoint obs106 fMonth
1993-01-01 25801 -0.5775727 0.8163393 -0.9101060 0.4143756 0 0 1
1993-02-01 26165 -0.3328195 -0.9429905 0.6565858 -0.7542514 0 0 2
1993-03-01 31277 0.9620277 0.2729519 -0.2850193 0.9585218 0 0 3
1993-04-01 32586 -0.7784623 0.6276914 -0.1377903 -0.9904614 0 0 4
1993-05-01 32786 -0.0627905 -0.9980267 0.5358268 0.8443279 0 0 5
1993-06-01 34497 0.8509945 0.5251746 -0.8375280 -0.5463943 0 0 6

For the span of the data there were three time periods judged to be contractions by the Business Cycle Dating Committee. They are Apr.2001 to Nov.2001, Jan.2008 to Jun.2009, and Mar.2020 to Apr.2020.

The first was caused by a drop in manufacturing, and perhaps was also a consequence of the 2000 dot com bubble. The second was the recession caused by a financial crisis which involved, among other problems, inflated real estate prices.The third, only two months in length, occurred at the onset of the COVID pandemic.

Time Series Plot: Sales vs. Time

New car sales in dollars show a steady upward trend from 1993 to 2000 and a leveling until 2007, followed by a sudden severe drop during the 2008–2009 recession. Beginning in the second half of 2009, sales move upward at a rate which is steep until 2016, and then the slope decreases somewhat. There is a severe drop during the downturn in March and April of 2020 as COVID arrives, and then there is noticeably increased sales during 2021 and 2022. There is no visible decrease in sales stemming from the 2001 recession. The long-run upward trend is fueled by several factors, including increase in population, price increases for new cars, and probably an increase in the number of multicar families. A strong rise in sales during 2021 is evident and a bit puzzling, because supply chain delays greatly affected the availability of new cars after COVID arrived, and many buyers switched to the purchase of used cars. There was some pent-up demand from very low sales in March and April of 2020, and this may have been a factor. It is also likely that there were price increases in the wake of the supply chain delays.

Another feature of the time series is a pronounced seasonal effect, with sales strongest in the spring and summer months. In addition, the plot shows increasing volatility as the level rises.

Log-transformation

The plot of the log of sales is similar to the previous plot, except that the pattern of the volatility is different. Here it is more pronounced in the early years, indicating that the log transformation is somewhat of an overcorrection. It is not clear from the two plots whether the changing volatility dictates the fitting of a multiplicative decomposition model, rather than an additive one.

First-order Differencing

As is common with time series, the log return transformation has removed much of the trend; a small amount remains for the years 2020–2022. The first two economic downturns have had some impact upon the log return. During 2001 there is increased volatility, and during the 2008–2009 recession there is actually decreased volatility. Moreover, the March to April downturn in 2020 produced disruption of the log return values.

Spectrum Analysis

The length of our TS data from 1993 to 2017 is 300, and half of the squared root of 300 is 8.66. Therefore, the span for spectrum analysis we choose is 9.

Overall, the spectral plot is not flat, we see strong seasonal structure and trend. From the red lines, which show the seasonal frequencies of monthly car logSales data, indicate a strong seasonal component. Blue lines, the calendar structure for the frequencies 0.348 and 0.432, shows that the trigonometric pair for 0.432 is insignificant.

We can see obvious bumps in frequencies 1/12, 2/12, 3/12, 4/12, and 5/12. This reveals a seasonal structure of the log car sales data. Twice of the blue line above the notch is much more smaller than the full extend of the spectrum, revealing that we have to reduced the white noise of the data in order to fit a good prediction model.

These features provide guidance for the construction of a multiplicative decomposition model. Also, we have to include trend components, seasonal components (‘fMonth’) and trigonometric pair (‘c348 and s348’)

Model Fit

Model 1: Multiplicative Decomposition Model to Sales (1993-2017)

There are about five turns in the graph, therefore, we choose a six-degree polynomial change multiplicative model. Also, we include the seasonal components, the interactions related to the sudden shift around year 2008, one pair of trigonometric variable for frequency 0.348.

## 
## Call:
## lm(formula = logSales ~ poly(Time, 6) + fMonth + Changepoint + 
##     Changepoint * poly(Time, 6) + c348 + s348 + obs106, data = carsales[1:300, 
##     ])
## 
## Residuals:
##       Min        1Q    Median        3Q       Max 
## -0.129741 -0.022424  0.000705  0.023176  0.153126 
## 
## Coefficients:
##                              Estimate Std. Error t value Pr(>|t|)    
## (Intercept)                 4.696e+00  1.785e+00   2.631 0.009011 ** 
## poly(Time, 6)1             -1.512e+02  4.525e+01  -3.341 0.000951 ***
## poly(Time, 6)2             -1.433e+02  4.132e+01  -3.468 0.000610 ***
## poly(Time, 6)3             -9.845e+01  2.836e+01  -3.472 0.000601 ***
## poly(Time, 6)4             -5.267e+01  1.468e+01  -3.588 0.000395 ***
## poly(Time, 6)5             -1.998e+01  5.360e+00  -3.728 0.000235 ***
## poly(Time, 6)6             -4.482e+00  1.112e+00  -4.031 7.21e-05 ***
## fMonth2                     3.801e-02  1.135e-02   3.350 0.000923 ***
## fMonth3                     2.005e-01  1.135e-02  17.656  < 2e-16 ***
## fMonth4                     1.460e-01  1.136e-02  12.856  < 2e-16 ***
## fMonth5                     1.961e-01  1.137e-02  17.244  < 2e-16 ***
## fMonth6                     1.773e-01  1.138e-02  15.580  < 2e-16 ***
## fMonth7                     1.858e-01  1.138e-02  16.317  < 2e-16 ***
## fMonth8                     2.076e-01  1.140e-02  18.204  < 2e-16 ***
## fMonth9                     1.003e-01  1.141e-02   8.791  < 2e-16 ***
## fMonth10                    8.907e-02  1.153e-02   7.723 2.19e-13 ***
## fMonth11                    3.043e-02  1.144e-02   2.659 0.008296 ** 
## fMonth12                    8.654e-02  1.145e-02   7.556 6.37e-13 ***
## Changepoint                 5.133e+02  8.025e+01   6.396 6.94e-10 ***
## c348                        6.592e-03  3.292e-03   2.002 0.046268 *  
## s348                        1.519e-02  3.287e-03   4.621 5.91e-06 ***
## obs106                      2.445e-01  4.150e-02   5.892 1.12e-08 ***
## poly(Time, 6)1:Changepoint -1.203e+04  1.944e+03  -6.187 2.24e-09 ***
## poly(Time, 6)2:Changepoint  1.011e+04  1.618e+03   6.246 1.61e-09 ***
## poly(Time, 6)3:Changepoint -5.656e+03  9.617e+02  -5.882 1.19e-08 ***
## poly(Time, 6)4:Changepoint  2.385e+03  4.058e+02   5.878 1.21e-08 ***
## poly(Time, 6)5:Changepoint -5.894e+02  1.119e+02  -5.266 2.83e-07 ***
## poly(Time, 6)6:Changepoint  8.350e+01  1.559e+01   5.357 1.80e-07 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 0.04008 on 272 degrees of freedom
## Multiple R-squared:  0.9724, Adjusted R-squared:  0.9697 
## F-statistic: 354.9 on 27 and 272 DF,  p-value: < 2.2e-16

The adjusted R-squared is 0.9697, the residual standard error is 0.04008. The trend structure components and seasonal components are significant.

The trend structure is overall not effective because there are a few significant outliers that need to be carefully considered (although we specifically considered the change in 2008). Also, the residual spectral plot suggests that residuals of model 3 are not following white noise structure.

To separate the model into two parts (before and after year 2008) is necessary because all the interactions and Changepoint variables have very small p-values much less than 0.05 (alpha level). This indicates all these variables are significant.

From the summary statistics, the trends given by these two models:

Log sales before the Changepoint/2008:

\[Sales_{Before2008} = exp(4.696-151.2t-142.3t^2-98.45t^3-52.67t^4-19.98t^5-4.482t^6)\] Log sales after the Changepoint/2008:

\[Sales_{After2008} = exp(517.996-12025.304t+9958.8t^2-5798.3t^3+2332.33t^4-609.38t^5+79.018t^6)\]

Static Seasonal Index

Static Seasonal Index (Model 1)
Jan 0.8856
Feb 0.9199
Mar 1.0822
Apr 1.0248
May 1.0775
Jun 1.0574
Jul 1.0664
Aug 1.0899
Sep 0.9790
Oct 0.9681
Nov 0.9130
Dec 0.9657

The static seasonal index table indicates that,

In Jan, the car sales is estimated to be 11.44 percent below the level of the trend.
In Feb, the car sales is estimated to be 8.01 percent below the level of the trend.
In Mar, the car sales is estimated to be 8.22 percent above the level of the trend.
In Apr, the car sales is estimated to be 2.48 percent above the level of the trend.
In May, the car sales is estimated to be 7.75 percent above the level of the trend.
In Jun, the car sales is estimated to be 5.74 percent above the level of the trend.
In Jul, the car sales is estimated to be 6.64 percent above the level of the trend.
In Aug, the car sales is estimated to be 8.99 percent above the level of the trend.
In Sep, the car sales is estimated to be 2.10 percent below the level of the trend.
In Oct, the car sales is estimated to be 3.19 percent below the level of the trend.
In Nov, the car sales is estimated to be 8.70 percent below the level of the trend.
In Dec, the car sales is estimated to be 3.43 percent below the level of the trend.

The month of highest car sales is August and the month of lowest car sales is January. The Estimated Static Seasonal Index Plot also shows a stable high level of car retails from March to August following by an obvious decrease in September and a recovery in February of the next year.

Residual Diagnostics

## 
##  Shapiro-Wilk normality test
## 
## data:  res3
## W = 0.9818, p-value = 0.0007343

Most points in Normal Q-Q plot follows the theoretical quantile line but there are slight deviation with a few significant outliers near both ends of the line. Low sale values are overestimated and high sale values are underestimated by model 1. Also, the p-value of Shapiro-Wilk normality test is less than 0.05, we are unable to prove that the residuals are normally distributed. Both results indicate that the residuals of model 3 are not perfectly follows normal distribution.

## 
##  Durbin-Watson test
## 
## data:  model3
## DW = 1.409, p-value = 6.933e-09
## alternative hypothesis: true autocorrelation is greater than 0
## 
##  Shapiro-Wilk normality test
## 
## data:  resid(model3)
## W = 0.9818, p-value = 0.0007343

The Residuals vs. Time plot shows a few outliers around year 2000, 2005 and 2009. The residuals fluctuates around a mean of zero. The variance has strong variation, revealing uncaptured trend structure in Model 1, especially for the contraction period. The lags 1, 6, 12, 18 and 36 exceed the blue lines in the auto-correlation plot, indicating that there are uncaptured variations in model 1. The data are not reduced to white noise perfectly. The Durbin-Watson test value is 1.409 which also shows a positive autocorrelation among the residuals. The Shapiro-Wilk test rejects the hypothesis that the residuals follow a normal distribution. This is perhaps primarily attributable to three observations which the model underpredicts, at February 2000, July 2005, and August 2009. Apart from these three, the rest show good agreement with normality.

The spectral graph is not flat as we need, indicating that there exists white noise in model 1 (the residuals do not follow the white noise structure). Double of the blue segment above the notch is shorter than the full extent of the spectrum, also indicates that we have not successfully reduced the white noise in the model. Moreover, we see peaks in frequencies 1/12, 3/12, and 5/12, indicating that there is uncaptured seasonal structure. There is no peak of 0.348 and 0.432 as we expected. Model 1 captures the calendar structure well.

Forecast Sales for 2018–2019

Forecast v.s. Actual Sales (Model 1)
Time Forecast Actual
2018.1 71037.90 67831
2018.2 77503.40 67580
2018.3 91884.02 83004
2018.4 88246.30 74537
2018.5 98381.04 82287
2018.6 97751.72 78447
2018.7 101985.95 78402
2018.8 111703.45 82388
2018.9 102590.43 73247
2018.10 107491.64 75864
2018.11 110004.83 73766
2018.12 120881.73 80973
2019.1 120812.92 67212
2019.2 138332.20 67726
2019.3 173016.97 82765
2019.4 184436.65 77114
2019.5 218075.48 84145
2019.6 234730.68 78334
2019.7 276544.54 82621
2019.8 326157.15 88042
2019.9 334430.99 75963
2019.10 403012.62 79167
2019.11 453337.69 78006
2019.12 575100.66 81741

By comparing the predicted sales data with the actual data, we can find that the prediction of model 1 is not accurate, and the predicted data is higher than the actual data. In the early period (periods 1 to 10) the prediction is relatively close and the prediction of fluctuation is more accurate (the blue line of patterning is more similar to the red line). However, the further back in time, the more the predicted data deviate from the actual values. It is also possible that the deviation from the forecast data is increasing exponentially because we are using log sales data in the fit model, but all in all, model 1 is not a good predictor of future sales.

Model 2: Multiplicative Decomposition Model (With Lag Residuals)

The lag 1 residuals have been added to model 1 to perform model 2.

## 
## Call:
## lm(formula = logSales ~ poly(Time, 6) + fMonth + Changepoint + 
##     Changepoint * poly(Time, 6) + c348 + s348 + obs106 + lag1resid, 
##     data = carsales[1:300, ])
## 
## Residuals:
##       Min        1Q    Median        3Q       Max 
## -0.118248 -0.023293  0.003412  0.021817  0.128565 
## 
## Coefficients:
##                              Estimate Std. Error t value Pr(>|t|)    
## (Intercept)                 4.744e+00  1.708e+00   2.777 0.005867 ** 
## poly(Time, 6)1             -1.500e+02  4.330e+01  -3.464 0.000619 ***
## poly(Time, 6)2             -1.422e+02  3.954e+01  -3.596 0.000384 ***
## poly(Time, 6)3             -9.768e+01  2.713e+01  -3.600 0.000378 ***
## poly(Time, 6)4             -5.226e+01  1.405e+01  -3.720 0.000242 ***
## poly(Time, 6)5             -1.982e+01  5.129e+00  -3.865 0.000139 ***
## poly(Time, 6)6             -4.447e+00  1.064e+00  -4.180 3.94e-05 ***
## fMonth2                     3.769e-02  1.086e-02   3.471 0.000603 ***
## fMonth3                     2.001e-01  1.086e-02  18.423  < 2e-16 ***
## fMonth4                     1.456e-01  1.086e-02  13.404  < 2e-16 ***
## fMonth5                     1.958e-01  1.088e-02  17.991  < 2e-16 ***
## fMonth6                     1.769e-01  1.089e-02  16.250  < 2e-16 ***
## fMonth7                     1.854e-01  1.089e-02  17.018  < 2e-16 ***
## fMonth8                     2.072e-01  1.091e-02  18.991  < 2e-16 ***
## fMonth9                     9.989e-02  1.092e-02   9.150  < 2e-16 ***
## fMonth10                    8.782e-02  1.104e-02   7.956 4.85e-14 ***
## fMonth11                    2.998e-02  1.095e-02   2.738 0.006585 ** 
## fMonth12                    8.605e-02  1.096e-02   7.852 9.61e-14 ***
## Changepoint                 5.175e+02  7.679e+01   6.739 9.55e-11 ***
## c348                        6.494e-03  3.150e-03   2.061 0.040232 *  
## s348                        1.520e-02  3.145e-03   4.834 2.24e-06 ***
## obs106                      2.649e-01  3.991e-02   6.638 1.72e-10 ***
## lag1resid                   2.983e-01  5.839e-02   5.108 6.15e-07 ***
## poly(Time, 6)1:Changepoint -1.214e+04  1.860e+03  -6.523 3.36e-10 ***
## poly(Time, 6)2:Changepoint  1.020e+04  1.549e+03   6.584 2.36e-10 ***
## poly(Time, 6)3:Changepoint -5.711e+03  9.202e+02  -6.206 2.02e-09 ***
## poly(Time, 6)4:Changepoint  2.409e+03  3.883e+02   6.203 2.07e-09 ***
## poly(Time, 6)5:Changepoint -5.964e+02  1.071e+02  -5.569 6.16e-08 ***
## poly(Time, 6)6:Changepoint  8.451e+01  1.492e+01   5.665 3.74e-08 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 0.03835 on 271 degrees of freedom
## Multiple R-squared:  0.9748, Adjusted R-squared:  0.9722 
## F-statistic: 374.7 on 28 and 271 DF,  p-value: < 2.2e-16

All the variables in model five have p-values less than 0.05 (alpha level), indicating these variables are significant and necessary to be included in the multiplicative model. The residual standard error of model 2 is 0.03835 which is slightly smaller than model 1. The adjusted R-squared is 0.9722 which is slightly higher than model 1. These indicate that model 2 fits better than model 1 but the improvement is not very significant.

Residual Diagnostics

## 
##  Shapiro-Wilk normality test
## 
## data:  res5
## W = 0.98631, p-value = 0.005992

The Normal Q-Q plot shows a few acceptable outliers near both ends of the theoretical normal line, but the remaining part fits better than model 1. This indicates the residual of model 5 follows the normal distribution better than model 1. The p-value of Shapiro-Wilk normality test of model 2 is slightly greater than that of model 1, although it is still less than 0.05.

After including lag 1 residuals variable, we can observe significant residuals around year 2001, 2007 and 2009 in residuals vs. time plot. Model 2 is still able to capture the trend structure during economic contraction period fully The variance is obviously not constant, but it is slightly better than residual variance of model 1. Also, lag 6, 12, and 18 exceed the blue line reveals that there exists auto-correlation among residuals. Therefore, the homoscedasticity assumption of residuals is violated. However, comparing to model 1, we can obviously see that lag 1 of the ACF plot becomes insignificant. This indicates that the data of model 2 is getting closer to the white noise structure.

## logical(0)

The peaks at frequencies 3/12 and 5/12 show the remaining white noise in model 2. Comparing to model 1, the effect of white noise at frequency 1/12 is eliminated. In this regard, the residual of model 2 is closer to the structure of white noise than model 1. (The blue line on the top right corner is shorter than this of model 1, but the reason might because it is more extended at point 0. Except for the zero point, the pattern is in the range of 5e-04 to 5e-03, and the scale is the same as model 1.)

From the residual vs. predictions plot, we see that points cluster in the middle of the graph. We expect a tendency to be more concentrated on the left and more dispersed on the right. Therefore, the result proves that the residual of the model not satisfy constant variance assumption.

Model 3: Multiplicative Decomposition Model to Log Return Sales

We include the seasonal component ‘fMonth’, the calendar structure variables ‘c348, s348, c432, s432’ and a dummy variable for the outlier ‘obs106’ in model 3.

## 
## Call:
## lm(formula = dlogSales ~ fMonth + c348 + s348 + c432 + s432 + 
##     obs106, data = carsales[1:300, ])
## 
## Residuals:
##       Min        1Q    Median        3Q       Max 
## -0.215434 -0.023945  0.001908  0.030891  0.142825 
## 
## Coefficients:
##              Estimate Std. Error t value Pr(>|t|)    
## (Intercept) -0.076822   0.009504  -8.083 1.85e-14 ***
## fMonth2      0.116239   0.013449   8.643 4.11e-16 ***
## fMonth3      0.242316   0.013439  18.031  < 2e-16 ***
## fMonth4      0.024307   0.013436   1.809   0.0715 .  
## fMonth5      0.130226   0.013446   9.685  < 2e-16 ***
## fMonth6      0.060809   0.013438   4.525 8.89e-06 ***
## fMonth7      0.088561   0.013440   6.589 2.15e-10 ***
## fMonth8      0.102198   0.013440   7.604 4.26e-13 ***
## fMonth9     -0.027245   0.013444  -2.027   0.0436 *  
## fMonth10     0.067573   0.013574   4.978 1.12e-06 ***
## fMonth11     0.011536   0.013449   0.858   0.3917    
## fMonth12     0.137633   0.013437  10.243  < 2e-16 ***
## c348         0.023072   0.003896   5.921 9.24e-09 ***
## s348         0.016979   0.003892   4.363 1.80e-05 ***
## c432         0.009254   0.003898   2.374   0.0183 *  
## s432        -0.001992   0.003888  -0.512   0.6088    
## obs106       0.302467   0.048809   6.197 2.03e-09 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 0.04748 on 283 degrees of freedom
## Multiple R-squared:  0.7314, Adjusted R-squared:  0.7162 
## F-statistic: 48.16 on 16 and 283 DF,  p-value: < 2.2e-16

The residual standard error is 0.04748. The adjusted R-squared is 0.7162.

Significance of Trend Variables: Partial F-test

## Analysis of Variance Table
## 
## Model 1: dlogSales ~ fMonth + Changepoint + c348 + s348 + c432 + s432 + 
##     obs106
## Model 2: dlogSales ~ poly(Time, 6) + fMonth + Changepoint + c348 + s348 + 
##     c432 + s432 + obs106
##   Res.Df     RSS Df Sum of Sq      F Pr(>F)
## 1    282 0.63789                           
## 2    276 0.62339  6  0.014497 1.0697 0.3809

By comparing the model with and without the trend component, the p-value for the partial F-test is 0.3809. This reveals the trend component is insignificant. We are going to remove the polynomial trend variable. Because performing differencing operation on log sales data has already eliminate most trend structure, the trend component seems not necessarily to be included.

Significance of Changepoint Variables: Partial F-test

## Analysis of Variance Table
## 
## Model 1: dlogSales ~ fMonth + c348 + s348 + c432 + s432 + obs106
## Model 2: dlogSales ~ fMonth + Changepoint + c348 + s348 + c432 + s432 + 
##     obs106
##   Res.Df     RSS Df  Sum of Sq      F Pr(>F)
## 1    283 0.63795                            
## 2    282 0.63789  1 5.7554e-05 0.0254 0.8734

The p-value for partial F-test related to “Changepoint” variable is 0.8734, indicating that the variable is insignificant.

Residual Diagnostics

## 
##  Shapiro-Wilk normality test
## 
## data:  res6
## W = 0.97204, p-value = 1.403e-05

There is an obvious deviation in the left side of the normal quantile plot with a few signicicant outliers. This indicates uncaptured trend of model 3. Low sale values are slightly overestimated by the model. Also, the p-value of Shapiro-Wilk normality test is 1.403e-05, even smaller than model 1 and model 2. Therefore, the residuals of model 3 fail to follow a normal distribution.

The residuals vs. time plot shows no improvement of capturing the unusual structure of economic downturn periods (comparing to model 1 and model 2). Also, lag 1, 2, 6, 10, 12, 18, and 36 on the ACF plot exceed the blue line, indicating a more auto-correlated residuals of model 3 than these of model 1 and model 2. Lag 6, 12, 18, and 36 are more significant indicates that there is still dynamic seasonal structure. Moreover, the log transformation and differencing operation are not able to deal with the Great Recession effect in 2009.

The peak around frequency 3/12 reveals remaining trend structure, and the peak at frequency 5/12 reveals remaining dynamic seasonal structure. The blue dash lines show that no calendar structure remains.

Static seasonal Index Comparison (Model 1, Model 3)

Static Seasonal Index Comparison
Model 1 Model 3
Jan 0.8856260 0.8871437
Feb 0.9199385 0.9203306
Mar 1.0821948 1.0830498
Apr 1.0248197 1.0248797
May 1.0775197 1.0781952
Jun 1.0574240 1.0582167
Jul 1.0664025 1.0678349
Aug 1.0899481 1.0923362
Sep 0.9790497 0.9817295
Oct 0.9681302 0.9700772
Nov 0.9129858 0.9063259
Dec 0.9656817 0.9605631
plot(ts(expseas3), col="blue", ylab="seasonal index",xlab="month", main = "Estimated Static Seasonal Index Plot ")
lines(ts(s), col="red")
legend(1,1.3, legend=c("Model 1","Model 3"), col=c("blue", "red"), pch=4, ncol=1)

The table and the plot below show that the static seasonal index estimates from model 1 and model 3 are almost identical. Although model 1 is not adequately to estimate the trend structure in the following 24 months, it still has the ability to estimate the static seasonal structure.

Discussion

Period from Mar.2020 to Jul.2022

In January and February of 2020 cases of COVID began to appear, and the illness started to dominate news reports. It wasn’t until March 2020 that COVID produced a strong effect on the U.S. economy, giving a two-month economic downturn. Economic recovery began to take effect in May 2020.

The effect of the short recession on new car sales was very strong. Given the fear of contracting COVID, some potential buyers were reluctant to shop in automotive dealer showrooms. But more than that, there was a shortage of new cars available for purchase because of supply chain disruptions that affected automotive parts required for manufacturing, and then, of course, many potential customers are reluctant to take on large dollar purchases when there is economic uncertainty and pressure during a recession. The plots in part 1 show a dramatic drop of new car sales during March and April of 2020. These months are followed by a more typical level of sales in the remainder of 2020. In 2021 sales increase considerably, perhaps for two reasons. One is the availability of COVID vaccines, and better understanding of how COVID was spreading and how to be cautious. Second, there was some easing of the supply chain delays. And third, the low sales during the recession and somewhat low sales in the remainder of 2020 led to a lot of pent-up demand. It is true that many buyers were switching to purchase of used cars in 2020, but these were experiencing considerable price rises because of the great demand. It made sense that buyers could resume purchase of new cars in 2021. The plots also show that dollars spent on purchases lowered somewhat to a more steady trajectory in 2022 after the steep rise in 2021.

Discussion and Speculation

During Covid-19, there was a significant drop in car sales. Unlike the impact of the Great Recession in 2008, the magnitude of the impact of Covid-19 on U.S. car sales was more dramatic, more abrupt, and shorter in duration. Another difference is that the Great Recession directly affected the overall car sales market trend (the intercept and slope of the time series plot) and took a long time to recover. Covid-19 temporarily reduced car sales but was followed by a sharp increase. The possible reason for this is that the Great Depression had an impact on people’s income and economic situation, making people reluctant to spend on cars. And the sudden drop of sales during covid-19 period may be temporary due to the country’s policy of entry and exit and import/export, resulting in people having money but not being able to buy cars. So after a period of time, the policy was slowly restored and 1) a group of people who were prepared to buy cars before concentrated on spending during the period after the restoration, and 2) there may have been some retaliatory spending.

Remarks

  1. The multiplicative decomposition model is suitable for the data.
  2. The prediction of model 1 is not accurate, and the predicted data is higher than the actual data.
  3. The month of highest car sales is August and the month of lowest car sales is January. There is a stable high level of car retails from March to August following by an obvious decrease in September and a recovery in February of the next year.
  4. Although model 1 is not adequately to estimate the trend structure in the following 24 months, it still has the ability to estimate the static seasonal structure.
  5. During Covid-19, there was a significant drop in car sales. Unlike the impact of the Great Recession in 2008, the magnitude of the impact of Covid-19 on U.S. car sales was more dramatic, more abrupt, and shorter in duration. Another difference is that the Great Recession directly affected the overall car sales market trend and took a long time to recover.

Reference

  1. A timeline of covid-19 developments in 2020. AJMC. (n.d.). Retrieved October 17, 2022, from https://www.ajmc.com/view/a-timeline-of-covid19-developments-in-2020

  2. President, J. C. V., Cusick, J., President; Roberts, W. (2022, January 19). 2021 was a year of bold economic policy that must be extended. Center for American Progress. Retrieved October 18, 2022, from https://www.americanprogress.org/article/2021-was-a-year-of-bold-economic-policy-that-must-be-extended/